A Karaka Based Approach to Parsing of Indian Languages

نویسندگان

  • Akshar Bharati
  • Rajeev Sangal
چکیده

A lex.[ca].[sod g t '&mlnaF formalism has been developed that a].lovas constraints to be specified between 'demand' ~and 'source' ~;or'ds (e.g., between verb and its karaka roles). The parser has two important novel features: (.[) It has a local word grouping phase in uhich wot"d gr'oups are formed using 'local' infor-marion onl~ ~. They are formed based on finite state machine specifications thu~ resulting in a fas~t grouper. (ii) The parser. is a general constraint :~o]ver. It first transforms the constr'aints to ~n integer programming pr.ob]em and t h e n solves it.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Parsing Free Word Order Languages in the Paninian Framework

There is a need to develop a suitable computational grammar formalism for free word order languages for two reasons: First, a suitably designed formalism is likely to be more efficient. Second, such a formalism is also likely to be linguistically more elegant and satisfying. In this paper, we describe such a formalism, called the Paninian framework, that has been successfully applied to Indian ...

متن کامل

On minimal realization of IF-languages: A categorical approach

he purpose of this work is to introduce and study the concept of minimal deterministic automaton with IF-outputs which realizes the given IF-language. Among two methods for construction of such automaton presented here, one is based on Myhill-Nerode's theory while the other is based on derivatives of the given IF-language. Meanwhile, the categories of deterministic automata with IF-outputs and ...

متن کامل

Chapter 76 Dependency Parsing in Bangla

A grammar-driven dependency parsing has been attempted for Bangla (Bengali). The free-word order nature of the language makes the development of an accurate parser very difficult. The Paninian grammatical model has been used to tackle the free-word order problem. The approach is to simplify complex and compound sentences and then to parse simple sentences by satisfying the Karaka demands of the...

متن کامل

An Affinity Based Greedy Approach towards Chunking for Indian Languages

A robust chunker can drastically reduce the complexity of parsing of natural language text. Chunking for Indian languages require a novel approach because of the relatively unrestricted order of words within a word group. A computational framework for chunking based on valency theory and feature structures has been described here. The paper also draws an analogy of chunk formation in free word ...

متن کامل

An Annotation Scheme for English Language using Paninian Framework

This paper presents a comprehensive study about the Panini’s karaka relations for English. Paninian framework is suitable to all Indian language but some issues occur when applied to English languages. This paper discuss what are these issues and different approaches that were used in past.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1990